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DETAILED ACTION 

Response to Arguments 

1 . Applicant's argument with respect to the use of Laurila et al. reference has been 
fully considered and is persuasive. Therefore, the rejection has been withdrawn. 
However, upon further consideration, a new ground(s) of rejection is made in view of 
Haverinen et al. (USPN 7197456). 

2. Applicant's argument regarding "the verification score of Sukkar is a likelihood 
score ratio of the sound associated with the subword hypothesis and the speech 
segment consisting of a different sound, rather than being obtained by inputting the 
normalized feature vectors to the verification-unit corresponded classifier" is however, 
not persuasive. One of ordinary skill in the art at the time of invention to readily realize 
that by incorporating the teaching of Haverinen et al. into Sukkar, the claimed invention 
would have been realized. Haverinen et al. produce a set of normalization parameters 
(means and standard deviation) for used to normalize the current input speech feature 
vectors, which would then be used by the system of Sukkar to perform further 
processing as indicated in figure 4 {feature vectors are received at step 322). 

3. The previous office action has been withdrawn in favor of a new non-final office 
action. 



Claim Rejections - 35 USC § 103 
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4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. Claims 1,8-11, and 18-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Sukkar (USPN 6292778) in view of Haverinen et al. (USPN 
7197456). 

6. Regarding claim 1 and 1 1 , Sukkar discloses a method and system (figure 4) for 
utterance verification comprising the steps of: 

(A) extracting a sequence of feature vectors from an input speech (col. 8, line 60 
to col. 9, line 19 or referring to element 310 in figure 3); 

(B) inputting the sequence of feature vectors to a speech recognizer for obtaining 
at least one candidate string (speech recognizer 312 in figure 3 produces a number of 
recognition candidates and sub-word segmentation; also referring to col. 9, line 59 to 
col. 10, line 19); 

(C) segmenting the input speech into at least one speech segment according to 
the content of candidate string, which comprises individual recognition units, wherein 
each speech segment corresponds to a recognition unit and each recognition unit 
corresponds to a verification unit (speech recognizer 31 2 in figure 3 produces a number 
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of recognition candidates and sub-word segmentation 312; or also referring to col. 9, 
line 59 to col. 10, line 19); 

(E) utilizing a verification-unit corresponded classifier for each speech segment 
to calculate the verification score, where the sequence of verification feature vectors of 
the speech segment is used as the input of the classifier (col. 10, line 58 to col. 11, line 
32 or referring to the operation of figure 4); 

(F) combining the verification scores of all speech segments for obtaining an 
utterance verification score of the candidate string (col. 10, line 58 to col. 11, line 32 or 
referring to the operation of figure 4); and 

(G) comparing the utterance verification score of the candidate string with a 
predetermined threshold so as to accept the candidate string if the utterance verification 
score is larger than the predetermined threshold (col. 11, lines 1-32 or referring to the 
operation of figure 4). 

While the system of Sukkar inherently includes the step of normalizing extracted 
speech features for speech recognizer, Sukkar fails to explicitly disclose (D) generating 
a sequence of verification feature vectors for each speech segment according to the 
sequence of feature vectors of the speech segment, wherein the verification feature 
vectors are generated by normalizing the feature vectors using the normalization 
parameters of the verification unit corresponding to the speech segment, the 
normalization parameters of the verification unit are means and standard deviations of 
the feature vectors corresponding to the verification unit in training data, and these 
parameters are calculated in advance of runtime. 
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However, Haverinen et al. teach (D) generating a sequence of verification feature 
vectors for each speech segment according to the sequence of feature vectors of the 
speech segment, wherein the verification feature vectors are generated by normalizing 
the feature vectors using the normalization parameters of the verification unit 
corresponding to the speech segment, the normalization parameters of the verification 
unit are means and standard deviations of the feature vectors corresponding to the 
verification unit in training data, and these parameters are calculated in advance of 
runtime (co/. 6, lines 5-46; the standard deviation and mean of the test signal are pre- 
computed before runtime; They are used at runtime to normalize feature vectors of the 
test signal). 

Since Kukkar and Haverinen et al. are analogous art because they are from the 
same field of endeavor, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Sukkar by incorporating the teaching of Haverinen et al. 
in order to improve speech recognition and verification accuracy. 

7. Regarding claims 8 and 18, Sukkar further discloses the method and system as 
claimed in claims 1 and 11, respectively, wherein in step (F), the utterance verification 
score of the candidate string is the mean of the verification scores of the speech 
segments in the input speech (col. 11, lines 5-15, averaging subword scores). 

8. Regarding claims 9 and 1 9, Sukkar further discloses the method and system as 
claimed in claims 1 and 11, respectively, wherein the input speech is corrupted by noise 
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(figure 1, speech is received via a telephone network, thus, a certain amount of noise 
must be added to the speech signal when received). 

9. Regarding claims 1 0 and 20, Sukkar further disclose the method and system as 
claimed in claims 6 and 16, wherein the speech segments used for training are 
corrupted by noise {speech recognizer 312 in figure 3 contains HMM models that are 
trained in advance; the speech used to train the speech recognizer 312 in advance 
inherently includes noise generated by either microphone of communication channel). 

1 0. Claims 3-7 and 1 3-1 7 are rejected under 35 U.S.C. 1 03(a) as being unpatentable 
over Sukkar (US 6292778) in view of Haverinen et al. (USPN 7197456), and further in 
view of Carey et al. (US 5526465). 

1 1 . Regarding claims 3 and 1 3, Sukkar fails to specifically disclose the method and 
system as claimed in claims 1 and 11, respectively, wherein in step (E), the classifier is 
a neural network, and the neural network is an multi-layer perceptron (MLP). However, 
Carey et al. teach that the classifier is a neural network, and the neural network is a 
MLP {col. 11, lines 1-20). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
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to increase discrimination between world models and personal models to improve the 
verification process. 

12. Regarding claims 4 and 14, Sukkar further discloses that the verification score of 
a speech segment is the mean of the verification scores of the sequence of verification 
feature vectors corresponding to the speech segment {col. 11, lines 5-15, averaging 
subword scores). Sukkar, however, fails to specifically disclose wherein the MLP is 
used to calculate the verification score by inputting the verification feature vector and 
performing the feed-forward processing. However, Carey et al. teach wherein the MLP 
is used to calculate the verification score by inputting the verification feature vector and 
performing the feed-forward processing (col. 9, lines 43-50, forward and backward pass 
calculations). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to improve speech recognition accuracy. 

13. Regarding claims 5 and 15, Sukkar fails to specifically disclose the method and 
system as claimed in claims 3 and 13, respectively, wherein the MLP is trained by using 
an error back-propagation algorithm to reduce the mean square error between the 
verification score output of the MLP and the target value. However, CAREY ET AL. 
teach wherein the MLP is trained by using an error back-propagation algorithm to 
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reduce the mean square error between the verification score output of the MLP and the 
target value {col. 11, lines 1-20 and col. 12, lines 1-20). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al.. in order 
to increase discrimination between world models and personal models to improve the 
verification process. 

14. Regarding claims 6 and 16, Sukkar fails to specifically disclose the method and 
system as claimed in claims 5 and 15, respectively, wherein the MLP corresponding to 
the verification unit is trained by inputting the sequences of verification feature vectors 
of the speech segments corresponding to the verification unit and the sequences of 
verification feature vectors of the speech segments not corresponding to the verification 
unit. However, Carey et al. teach wherein the MLP corresponding to the verification unit 
is trained by inputting the sequences of verification feature vectors of the speech 
segments corresponding to the verification unit and the sequences of verification feature 
vectors of the speech segments not corresponding to the verification unit (col. 10, line 
43 to col. 11, line 34). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to train the speech recognizer to recognizer the speaker with high accuracy. 
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1 5. Regarding claims 7 and 1 7, Sukkar fails to specifically disclose the method and 
system as claimed in claims 6 and 16, wherein the target value is 1 if the speech 
segment corresponds to the verification unit and which is 0 if the speech segment does 
not correspond to the verification unit. However, Carey et al. teach wherein the target 
value is 1 if the speech segment corresponds to the verification unit and which is 0 if the 
speech segment does not correspond to the verification unit (col. 11, line 60 to col. 12, 
line 7, P p =0 and P w =1, that means the speaker is not verified; and P p =1 and P w =0 
means the speaker is verified). 

Since Sukkar and Carey et al. are analogous art because they are from the same 
field of endeavor, it would have been obvious to one of ordinary skill in the art at the 
time of invention to modify Sukkar by incorporating the teaching of Carey et al. in order 
to verify speaker with high accuracy. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to HUYEN X. VO whose telephone number is (571)272- 
7631 . The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on 571-272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/HuyenXVo/ 9/1/2008 
Primary Examiner, Art Unit 2626 



